Challenges You Will Face When Parsing PDFs with Python
theseattledataguy.comยท2hยท
Discuss: Hacker News
๐Ÿ“„PDF Archaeology
Ancient Scripts, Modern AI: Bridging the Divide with Morphology-Aware Tokenization by Arvind Sundararajan
dev.toยท1dยท
Discuss: DEV
๐Ÿ“Concrete Syntax
Fastest copy
forums.anandtech.comยท49m
๐Ÿ“„Document Digitization
Semantic Dictionary Encoding
falvotech.comยท2hยท
Discuss: Hacker News
๐ŸŒ€Brotli Dictionary
Lessons from using AI in Discovery
thoughtbot.comยท17h
๐Ÿ•ต๏ธMetadata Mining
WorldCat Editions and Holdings Release
annas-archive.orgยท1dยท
Discuss: Hacker News
๐Ÿ“šMARC Records
Converting a PDF to text locally with Ollama
huijzer.xyzยท2d
๐Ÿ‘๏ธOCR Verification
Mike Driscoll: Erys โ€“ A TUI for Jupyter Notebooks
blog.pythonlibrary.orgยท4h
๐Ÿ“บTerminal UI
DCP-o-matic โ€ข Re: Color Space and Gamma when exporting From Davinci Resolve
dcpomatic.comยท2d
๐Ÿ–ผ๏ธJPEG XL
Show HN: Semlib โ€“ Semantic Data Processing
github.comยท3hยท
Discuss: Hacker News
๐ŸŒณIncremental Parsing
UTF-8 Is Beautiful
hackaday.comยท12h
๐Ÿ”ฃUnicode
Generating Consistent Illustrations with Gemini Image Generation
tinystruggles.comยท1dยท
Discuss: Hacker News
๐ŸŒˆColor Archaeology
Towards an AI-based knowledge assistant for goat farmers based on Retrieval-Augmented Generation
arxiv.orgยท13h
๐Ÿ”Information Retrieval
From Legal Documents to Knowledge Graphs
neo4j.comยท2dยท
Discuss: Hacker News
๐Ÿ“‹Document Grammar
How to Remove Invisible Characters From AI Text (Free Tool)
hackernoon.comยท1d
โœ๏ธOCR Correction
Analyzing Lisp Redux: One Form At a Time
funcall.blogspot.comยท2hยท
๐Ÿ”—Lisp
IETF Draft: Authenticated Transfer Repo and Sync Specification
ietf.orgยท6hยท
Discuss: Hacker News
๐ŸŒณArchive Merkle Trees
Docling: The Document Alchemist
towardsdatascience.comยท3d
๐Ÿ“‹Document Grammars
How to self-host a web font from Google Fonts
blog.velocifyer.comยท2hยท
Discuss: Hacker News
๐Ÿ”คFont Archaeology
LLM Rerankers for RAG: A Practical Guide
fin.aiยท19hยท
๐Ÿ”Information Retrieval